Classifiying offensive sites based on image content

نویسندگان

  • Will Archer Arentz
  • Bjørn Olstad
چکیده

This paper proposes a method for helping to identify adult web sites by using the image-content as means of detecting erotic material. The image content is classified by investigating probable skinregions, and extracting their feature vectors. These feature vectors are based on color-, texture-, contour-, placementand relative sizeinformation for a given region. The importance of the different elements in the feature vector is determined by a genetic algorithm. For each picture, the algorithm gives the probability that a certain picture has erotic content. By mapping all the images in a web-site, and running the image-based classifier on the whole collection, we were able to set up a histogram of images with regards to the loglikelihood of erotic content for each image. Hence giving a good overview of the web-site’s content and at the same time leaving room for errors in the image-based classifier. The algorithm proved to be quite successful in our tests where all 20 sites where classified correctly. The image-based classifier is able to properly identify 89% of the evaluation images at an average processing speed of 11 images per second. Although this experiment focused on classifying adult web-sites, small alterations to the system can be done, enabling classification of other kinds of images and web-sites.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Classifying offensive sites based on image content

This paper proposes a method for helping to identify adult web sites by using the imagecontent as means of detecting erotic material. The image content is classified by investigating probable skin-regions, and extracting their feature vectors. These feature vectors are based on color-, texture-, contour-, placement-, and relative size-information for a given region. The importance of the differ...

متن کامل

Statistical Classification of Image Content for Visual Information Filtering

An increasing number of freely accessible adult content websites arose recently, displaying a wide variety of different offensive images and videos. Since many users do not want to be confronted with such material, automatic tools to detect and filter these images and videos are needed. Additionally, tools are required to protect children from accessing offensive websites. This thesis presents ...

متن کامل

A New Effective System for Filtering Pornography Images from Web Pages and PDF Files

One of the more rapidly growing areas in search technology is image search. With this availability comes the natural need to filter offensive content, to prevent Pornography images from reaching the wrong eyes. Filtering and blocking software is one of the most frequently touted prevention devices. As any user of these services is aware, they often fail to remove offensive images. The reasons a...

متن کامل

Image retrieval using the combination of text-based and content-based algorithms

Image retrieval is an important research field which has received great attention in the last decades. In this paper, we present an approach for the image retrieval based on the combination of text-based and content-based features. For text-based features, keywords and for content-based features, color and texture features have been used. Query in this system contains some keywords and an input...

متن کامل

Pornographic Image Filtering Using Skin Recognition Methods

In this paper, we describe various skin detection methods, image filtering methods and comprehensive comparative study among these methods proposed to detect adult classified images. It is based on concept of Computer Vision algorithms and pattern recognition techniques. First the images are changed to identify areas with low color intensity by the color model. In the next part of the proposed ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003